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1. INTRODUCTION 

Successful modeling and prediction of processes peculiar to complex systems, such as 
ecological, social, and economical (ESE) ones, remain one of the most relevant problems as 
applied to the whole complex of natural, human and social sciences ([l]-[6]). 

The diversity of possible approaches to modeling such systems and, usually, more than 
modest success in the dynamics prediction, compel us to look for the reasons of failure, find- 
ing them not only in details, but also in the axiomatics, which relates to problem statement, 
chosen modeling methods, results interpretation, connections with other scientific directions. 

With the appearance of quantum mechanics and relativity theory in early twentieth 
century new philosophical ideas on physical values, measuring procedures and system state 
have been established, the ones that are completely different from Newtonian notions [T] [S] . 

For more than 70 years basic concepts of classical and neoclassical economic theories have 
been discussed by leading scientists, generating new approaches [9j. The general systems 
theory has acquired recognition in the middle of the 20th century giving way to development 
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of the new, systemic, emergent, and quantum in essence approach to investigation of complex 
objects, which postulates the limited nature of any kind of modeling and is based upon fixed 
and closed system of axioms [TU] . 

However, the development of this new philosophical basis of ESE systems modeling is 
still accompanied with numerous difficulties, and new principles are often merely declared. 

Current research is devoted to investigation and application of the new modeling and 
prediction technology, suggested in [HI [12], based on concepts of determined chaos, complex 
Markov chains and hierarchic (in terms of time scale) organization of calculating procedures. 

2. ANALYSIS OF PROMINENT PUBLICATIONS RELEVANT TO THE 

SUBJECT 

Prediction of financial-economic time series is an extremely urgent task. Modern ap- 
proaches to the problem can be characterized by the following directions: 1) approximation 
of a time series using an analytical function and extrapolation of the derived function to- 
wards future - so-called trend models jl3]; 2) investigation of the possible infiuence various 
factors might have on the index, which is being predicted, as well as development of econo- 
metric or more complicated models using the Group Method of Data Handling (GMDH) 
[31 HI]; 3) modeling future prices as the decisions-making results using neuronal networks, 
genetic algorithms, fuzzy sets [TIHTG] . 

Unfortunately, these techniques don't produce stable forecasts, what can be explained by 
complexity of the investigated systems, constant changes in their structure. Although we 
are trying to join these directions in one algorithm, it is the latter option that we prefer, 
with it consisting in creating a model adequate to the process generating a price time series 
|17j . This very approach gives a chance to approach the complexity of the system, which 
generates the observed series, develop the model and use its properties as the prognosis. 

3. AIMS OF THE PAPER, PROBLEM STATEMENT 

Assume the time series is set by a sequence of discrete levels with constant step of time 
sampling At. We need to generate variants of the time series continuation (prognosis sce- 
narios) according to the relations between the sequences of absolute and relative changes 
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discovered with the help of complex Markov chains. 

4. CLASSICAL MODELING PROBLEMS OF ESE SYSTEMS DYNAMICS 

Another peculiar feature of ESE systems, apart from complexity, is a memory, including 
the long-term one, as well as nonlinear and unstable nature of interactions and components, 
which makes it harder to predict their future behavior. 

Unfortunately, mathematical models based on differential equations have no memory 
(there is no aftereffect), while for models with memory, where integral interrelations are 
used, it is not always possible to take into account nonhnearity (the integration procedure 
is linear by definition). 

In reality, in the Cauchy problem future systems behavior is defined by its initial state 
and doesn't depend on the way the system reached its current state. However, it is hardly 
true that future behavior of a real socio-economic or socio-ecological system can be predicted 
by giving an immediate time "slice" of a variables set that describe its state. 

Let us consider possible ways to take into account past events while modeling ESE sys- 
tems' dynamics, which goes beyond the boundaries of classical differential and integral equa- 
tions. 

Functional differential lagging equation can serve as a simple example of the dynamic 
model with memory, where present time is defined by the state variable x{t) and depends 
on the past state x{t — t) with constant time lag r = const: 



where f{x) is the known function, with initial conditions being set for the half-interval 
to — T < t < to by the function (f){t): 




(1) 



x(t) = 4>(t); to - T <t <to. 



(2) 



Given the [2] equation [T] has the only solution, defined by recurrent ratios: 



/(0(t-r)); if to <t <t + r; 



x{t) = < 



f if {4>{t - r))) ; ifto + r<t<t + 2T; 



(3) 



/((/(0(t-r)))); iito + 2T<t<t + 3T- 
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Using Dirac delta function, as defined by ratios: 

+00 

6{t) = 0, if X ^ 0; J 5{t)dt = 1, (4) 
—00 

we can formally rewrite equation [T] in the integral form: 

t 

x{t) = j dtj Hiti,ty, Hit,,t) = 5it^-it-r));t> t^. (5) 

—00 

Delta function is not a function in the conventional interpretation and is related to the 
class of generalized functions that were mathematically described only in the middle of 
the last century [TH] (physics started using this function much earlier). Its classical form is 
considered to be a limit of the "peak" sequence, with its centre set in the point of origin. The 
afore-mentioned "peaks" indefinitely converge widthway, indefinitely increase throughout the 
height and have a unit area. 

An approximate classic integral analogue of the equation [5] can be derived by substituting 
6{t) with an ordinary function - some specific narrow enough "peak" of a unit area, a certain 
finite width ^ At as well as a finite height . The derivative of the Fermi function is one of 
the possible examples: 

'^(t) = \ S(t) ^ —. T-^. (6) 

l + exp{^)'^^ e {2 + exp {^) + exp (D) 

If the system's state in the moment t, x{t), is defined not by one, as in [l| but k {k = 
2, 3, 4, ■ ■ ■ ) of her past states x{t — ti), x{t — T2), ■ ■ ■ x{t — r^) in the following moments of 
time (t — Ti), (t — T2), ■ ■ ■ ,{t — Tfc) respectively {ji = const, T2 = const, ■ ■ ■ ,Tk = const, ti > 
T2 > ■ ■ ■ > Tk > 0), then instead of Q, ^ we get: 

x(t) = f {x(t - Ti); xit-T2);-- - ; x{t - Tk)) ; t > to; (7) 



x{t) = <p{t);tQ -ri<t<to 



(8) 



t t t 
x{t) = J dti J dt2--- J dtkf ix{ti),x{t2),- ■ ■ ,x{tk)). 

—00 —00 —00 

6 ((ti - (t - n)) 6 {{t2 - (t - rs)) ■ ■ ■ 5 {{tk -{t- Tk)) ■,t>to. 



(9) 
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Therefore if the system's state in the time moment t depends on the infinite sequence 
of its past states, the integral analogue of the functional differential lagging equation will, 
generally speaking, contain an integral of the infinite multiplicity. At the same time the 
infinite amount of past states can relate to both finite {t — Ti;t) (short-term memory) and 
infinite (—00; t) (long-term memory) time span. 

Pay attention that the classic integral lagging equation is one of the Volterra type [19] : 

t 

.W.jF(.itM (10) 

—00 

where F (x(f);t;t) - is an arbitrary (generally nonlinear) function of variables x{i);t;i, 
which allows to take into account system's memory of its past states only in the additive 



approximation, which becomes evident, if the right section 10 is rewritten in the following 
way: 

t t _ *i 

/ F {x{i); t;t)di= J F (x(t); t;t)dt + J F {x{i); t;i)di+-- - = 

— 00 tl t2 

F (x(fi); t; fi) ■{t-ti) + F (^(fa); t; £2) ■ (ti - ta) + ■ ■ ■ ; ^^^^ 

t > tl > t2 > ■ ■ ■ -,£1 e [ti,t] -,{2 e [t2, tl] ; ■ ■ ■ 

In connection with it note that the equation |9] in case of an additive dependency of 
contemporaneity on the past, i.e. in case: 

/ {x{t - n); x(t - rs); • ■ ■ ) = /i {x{t - n)) ; ^ {x{t - T2))) ; • • • (12) 



becomes a particular case of the equation 10 with the following integrand: 



F {x{iy, t; t) = A [xii)) 6 {i -{t- n)) + f2 {x{i)) 6 {i - (t - T2)) + ■ ■ ■ (13) 

Meaningful analysis of nonlinear models dynamics with memory, in which the future is 
defined by the infinite amount of states in the past is generally possible only in case of a 
discrete representation. The results of such analysis will be approximated, i.e. will contain 
uncertainty, which has to be considered endogenous, i.e. internal, and peculiar to this very 
system. 

With a certain level of time sampling, models with memory both [7] and 10 becomes: 



x{n + 1) = f {x{n); x{n — 1); x{n — 2) ■ ■ ■ ) 



(14) 
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To take into account and quantify the uncertainties, observed in ESE as well as other 
complex systems probability models are normally used. However their application is based 
on doubtful hypotheses, while the statistical interpretation of the results is not always infor- 
mative enough and results might not correspond with the real process occurring within the 
system. In particular, the well-known problem of l//-noise (look for example [20]), closely 
connected to the presence of long-term memory in complex systems, implies the absence of 
the mean temporary value (as a limit of a certain time span converging to infinity, which 
serves as the basis for averaging) for any process occurring in such kind of system. Therefore 
such processes can't have a rigorous statistical substantiation. 



5. MODERN CONCEPTS IN ESE SYSTEMS MODELING 

New approaches to modeling and prediction of complex nonlinear systems dynamics with 
memory are based on the use of determined chaos and neural networks technologies (cf. 
e.g. [ISl EH |22]). Both investigation and realization of such techniques has become possible 
only with the appearance of quick-operating computers. Use of the recurrent computational 
process has become the general feature for all these technologies: 

Xn+l = fn ifn-l (" " " {fl{Xl) ■■■))), U = 1, 2, ■■■ , (15) 

where fi{xi) is a certain nonlinear mapping of a multi-dimensional vector Xj, i - discrete. 



real or fictitious, time. Identification of the model 15 is reduced to the determination of 
functions fi{xi), while the differences between the models of determined chaos and neural 
networks are connected with the function type and methods of its definition (neural net- 
work models normally use rather narrow class of fi{xi) mappings [16j). Generally speaking, 
stability or convergence of the process [15] is not required, whereas a single-step set of vector 



Xi components as well as their time dynamics can be of great interest. 



For the particular case of the model 15, introducing corresponding lagged variables, a 
model [14] can be transformed. 

Both determined stable processes, described by integro-differential equations, and random 
processes, which also include complex Markov chains (CMC), can be formally considered 



as separate extreme cases of determined chaos models realization 15 Given the sampling 



scale, which tends to zero, if such a tendency makes sense and corresponding limits exist, we 
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derive classical differential and integral problem statement. Finite At allows to get models 
with discrete time, which in the general case in the corresponding phase space (which also 
includes lagged variables) can produce both measurable sets (discrete or continuous) that 
allow probabilistic interpretation and those of the special structure - fractals p3j , that can't 
be always interpreted in that way. 

Various digital generators of so-called random sequences used in imitational modeling 
can be an example of determined chaos models that allow probabilistic interpretation. 

Let us note that in reality there are no accurate procedures that would give an opportunity 
to distinguish a "real" random sequence from the pseudorandom one. 

6. MARKOV CHAINS PREDICTION TECHNOLOGY 

Suppose there is a sequence of a certain system discrete states. From this sequence we 
can determine transitions probabilities between the two states. Simple Markov chain is a 
random process, in which the next state probability depends solely on the previous state 
and is independent from the rest of them. Complex Markov chain, unlike the simple one, 
stands for the random process, in which the next state probability depends not only on the 
current, but also on the sequence of several previous states (history). The amount of states 
in history is the order of the Markov chain. 

Theory of simple Markov chains is widely presented in literature, for example [21]. As 
for the high order Markov chains, modern literature [25] can offer us a mere definition. 
Developing complex or high order Markov chain's properties is not widely presented in 
modern scientific publications. It's necessary to mention the papers [261 [2l] where properties 
of complex Markov Chains are developed, but no prediction algorithm is proposed there. 
The development of prediction method, based on complex Markov chains, is proposed in 
this paper. 

Markov chain of the higher order can be brought to a simple Markov chain by introducing 
the notion of a "generalized state" and including a series of consequent system's states into 
it. In this case, tools of simple Markov chains can be applied to the complex ones. 

Investigated dynamic series is a result of a certain process. It is assumed that this process 
is determined, which implies the existence of a causal dependence of further states on history. 
It is impossible to fix and analyze the infinite history, which puts obstacles in the way of an 
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accurate detection of this influence and making precise predictions. 

The problem consists in the maximal use of information, which is contained in the known 
segment of the time series, and subsequent modeling of the most probable future dynamics 
scenario. 

The observed process is described as a time series of prices pt with the given sampling 
time span At 

Pu=p{to + iAt). (16) 

Discrete presentation of the time series is in fact a way of existence of this very system. 
New prices are formed on the basis of contracts or deals, made on the market in certain 
discrete moments of time, while the price time series is a series of the averaged price levels 
during the chosen time intervals. While making a decision each trader, who is an active 
part of the pricing system, works solely with discrete series of the chosen time interval (e.g. 
minute, 5-minute, hourly, daily etc.). For At — t- the accuracy of data presentations reaches 
a certain limit, since for relatively small At the price leaps in the moment of deal, while 
staying unchanged and equal to the last deal during the time between the two deals. Hence, 
the discreteness of time series has to be understood not only as a limited presentation of 
activity of the complex flnancial system, but also as one of the principles of its operation 

[iiiiTaEHHan]. 

The time series of initial conditions has to be turned into a sequence of discrete states. 
Let us denote the amount of chosen states as s, each of them being connected to the change 
in the quantity of the initial signal (returns). For example, consider the classiflcation with 
two states, flrst of which corresponds to positive returns as the price increases, while the 
second one - to negative as it descends. Generally all possible increments of the initial time 
series are divided into s groups. Ways of division will be discussed further. 

Next we develop predictions for the time series of sampled states. For the given order 
of the Markov chain and the last generalized state the most probable state is chosen to 
be the next one. In case if ambiguity occurs while the state of maximum probability is 
being evaluated, an algorithm is used that allows reducing the amount of possible prediction 
scenarios. Therefore we get the series of predicted states that can be turned into a sampled 
sequence of prognostic values. 

Evaluation of increments, prediction, and subsequent restoration are conducted for the 
given hierarchy of time increments t. To use the given information as effectively as possible, 
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the prediction is conducted for time increments t = 1, 2, 4, 8, or a more complex hierarchy 
of increments and subsequent "sphcing" of the results derived from different prediction 
samplings. 

The procedure of prediction and splicing is iterative and conducted starting from smaller 
increments, adding a prediction with the bigger time increment on every step. 

As the sampling time step t increases, the statistics for the investigation of Markov chains 
decreases, whereas the biggest sampling step, which takes part in the prognostication, limits 
itself. To supplement the prediction with the low-frequency component the approximation 
of zero order is being used in the form of a linear trend or a combination of a linear trend 
and harmonic oscillations [3T1 132] . 

7. PREDICTION CONSTRUCTION ALGORITHM 

Let us consider the consequence of operations, required for the prognostic time series 
construction. To do this we need to set the following parameters: 

1) The type of time increments hierarchy (simple - powers of two, complex - product of 
powers of the first simple numbers). 

2) Values of s - the amount of states and r - the order of the Markov chain. These 
parameters can be individual for every sampling level; finding of optimal parameters is done 
experiment ally. 

3) Threshold values S, and minimal number of transitions Nmin- 
Prediction construction algorithm includes the following steps: 

1) Generating hierarchy of time increments - t sequence. The maximal of them has to 
correspond to the length of a prognostic interval N^ax- 

2) For every time increment At, as the increments increase, a prediction of states and 
restoration of the time series along the prognostic states is conducted. Current stage includes 
following actions: 

2.1. Evaluating increments (returns) of the series with At sampling. 

2.2. Transforming the time series of increments into the series of state numbers (l..s). 

2.3. Calculating transition probabilities for generalized states. 

2.4. Constructing the series of prognostic states using the procedure of defining the most 
probable next state. 
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2.5. Restoring the value series from the state series with At samphng. 

2.6. Sphcing the prediction of At samphng with the time series derived from sphcing of 
the previous layers (with the lesser step At). In case if the current time series is the first 
one, the unchanged time series will come as a result of splicing. 

3) To splice the last spliced time series with the continuation of the linear trend, created 
along all previously known points. 

The time series, spliced with the linear trend, is the result of prediction. Let us consider 
the stages of the given algorithm in detail. 



8. STATES IN COMPLEX MARKOV CHAINS AND APPROACHES FOR 

DEFINING THEM 

In everything that concerns current technology, states are connected to the measuring of 
a prognostic value. There is a number of ways to classify returns in states, from which the 
following are suggested. One of them is the classification based on the homogeneity principle 
as concerning the amount of representatives in classes; based on the homogeneity principle 
of deviation, as well their combinations for different deviation modules. 

Increment or returns of the time series serves as the basis for states classification [32l 133] . 
Absolute Ta and relative increments of the time series are considered: 

ra=Pt- Pt-At, (17) 



Pt - Pt^At „v 

rt = , (18) 

Pt 

where pt - is the input time series of price dynamics. At - sampling interval, which is 
chosen for subsequent analysis. It is known that mathematical expectation of the returns 
time series equals zero, whereas variation comes as the measure of time series volatility. 
Based on returns values classification and transformation of values to the time series of 
discrete states are conducted. One of the classification principles is homogeneity according to 
the amount of class representatives. This classification divides the set of all increments into s 
groups equal in number. Calculated with the given sampling, time series increments are then 
systematized in growth and divided into equal parts. Thus we define limit values {rum^i}, 
which are used afterwards during transformation of the returns into class numbers. Large 
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number of identical states can cause certain problems, such as identical bounds of several 
neighbouring states. It creates a number of states with no representatives, which makes 
correction of the division a necessary action. In that way we will reach the largest possible 
homogeneity in state division. Classification is conducted along the following algorithm 



where St is the number of state, which corresponds to the moment of time t, for which the 
returns level was computed r^; i is the number of state [1 . . . s], which is characterized by 
the interval [riim,i-i,riim,i] corresponding to the calculated returns level r^. 

Apart from the returns interval, given by the aforementioned values [riim,i-i,i^iim,i], a 
mean returns value is chosen for every state ravg,i, which will be used in time series values 
transformation according to the prognostic discrete states. 

Another way of dividing the time series into states implies dividing the interval of returns 
values into equal parts, from minimal to maximal deviation. In this case homogeneity 
according to the amount of representatives in states does not occur. In fact this method 
differs from the previous one in terms of defining limit values {riim,i}- Possible combined 
ways of division, in case of which the limit value, dependent on standard deviation, is used 
instead of maximal and minimal value, and division is conducted homogenously according 
to the deviation. 

Since the real causal dependence is unknown during the process, to find it adequate 
state classification, which would allow to reveal vital dependencies of the time series, is 
required. We suggest a couple of ways to divide the time series into states, which in the first 
place allow to preserve adequate transition probabilities between states, as well as prevent 
averaged deviations inside the states from affecting the accuracy of the derived prediction. 

To check the efficiency of division we conduct the sampling procedure and classify the 
increments according to each hierarchy. Having completed that, we restore the time series 
using known states for each hierarchy and finish the splicing procedure. Since the state series 
correspond to the initial time series, we get the curve, with deviation, caused exclusively 
by the state averaging mistake (quantum mistake). Thus, having set a certain value of 
state numbers s and carried out sampling, restoring, and sampling procedures (excluding 
prediction), we get absolute sampling (quantum) mistake. 



[MIES]: 




(19) 
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Increasing the number of states, we improve the accuracy of restoration, however one 
should remember, that the choice of the quantum levels is limited by the fact that the 
transition probabilities definition with sufficient accuracy is required, which is confirmed by 
artificial test time series prediction experiments. 

9. STEP-BY-STEP PREDICTION PROCEDURE. DEFINING THE MOST 
PROBABLE STATE ON THE NEXT STEP, PREDICTION SCENARIOS 

Predicting procedure uses the most probable state as the next one under current circum- 
stances. Probability matrix of state transitions is used for the afore-mentioned purpose. 
In this case, you have to take into account that probabilities are calculated with a certain 
mistake. We cannot precisely compute the probabilities, since it is impossible to derive an 
infinite time series, and only a part of the time series is known - the known part serves as 
the basis for probabihties. The second important aspect implies the case of several states 
with maximal probability. 

To prevent the omission of the states, for which the probabilities are computed with a 
mistake, one should add a state with maximal probability to the states, which are located in 
the distance of 6 from the maximal one. The value of parameter 6 depends on the probability 
evaluation mistake and requires experimental refinement. 

U S > 0, the number of states with maximal probability increases in comparison to the 
value 5 — 0. Let us call a couple of neighbouring states with maximal probability a cluster. 
Cluster states with average deviation values are supposed to have the largest probability. 
To predict the dynamics, let us confine ourselves to one or two most probable states. To 
define them a following algorithm is suggested: 

1) If levels (discretized increments) create several chistcrs (cluster is a group of several 
neighbouring levels - cluster elements, minimal cluster is a single isolated level) with maximal 
probability, we choose the largest cluster. 

2) If the number of cluster elements is odd, as kmax we choose a central element. 

3) If the number of cluster elements is even, we consider two central cluster elements and 
choose as kmax the one, which is closer to the centre of distribution. 

4) If two central cluster elements are equidistant to the centre of distribution, we consider 
both cases as possible variants of kmax values (bifurcation point). 
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5) If there are several clusters of maximal size, we consider them as new elements, which 
can also form clusters that will undergo the same steps l)-4). 
This principle is based on the following ideas: 

1) If there are two neighbouring states of maximal probability, it is better to take the 
one, which is closer to the centre of distribution, in order to minimize the risk of occurrence 
of false linear trends in the prediction. 

2) If levels of maximal probability are not the neighbouring ones, at least two variants 
have to be considered, as it can be connected to the bifurcations that should not be omitted. 

3) If the prediction is carried out according to 1) (on all stages of the hierarchy), we 
receive a certain approximation of the lower limit of the prediction, whereas in case of 2) - 
we get an approximation of the upper limit. 

Hence this algorithm can adequately restore the case of possible bimodal probability 
distribution, it is proposed to consider 2 prediction scenarios. 

In case of the complex Markov chains, probability of the next state depends not only on 
the previous state, but also on the sequence of r states, which have occurred before given. 
In this case, it is necessary to calculate transition probabilities from the sequence of r states 
into the r + 1 state. Formally, these probabilities can be written into the rectangular table 
of (r*, s) size. 

Having generalized the notion of "present state" and included a sequence of r preceding 
states into it, we can reduce Markov chains of r order to the chain of the first order. Thus 
transition probabilities can be written into rectangular matrices of {r^,r^), that come as 
transition probability matrices for generalized states. 

The process of prediction implies the following: the last state is chosen (in case of Markov 
chains of an order r > 1 a sequence of r latest states is taken). The probability of transition 
from current state to all possible states is defined. From all possible states a state with 
maximal probability is chosen. It is possible that several states with maximal probability 
occur, which can be explained by the bimodal probability distribution. The process of 
decision-making in this case is described later. 

The chosen most probable state is taken as the next prognostic state and the procedure 
is repeated for the next (last added) state. Thus we receive a time series of prognostic states 
for the given sampling time At. 

Further according to the received state sequence and known initial value the time series is 
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being restored for the given time sampling At. In this case every state imphes At points of 
the time series. On the stage of state classification every state was connected to the average 
increment ravg,i, which is added to the value of the last point in the time series, and the 
next discrete point is computed. Intermediate points are filled as linear interpolation of two 
known neighbouring points. Algorithm of yt time series values restoration according to the 
initial price pt and a series of average increments ravg,ik, corresponding to the prognostic 
states Sk, can be given by a sequence of calculations: 



yt = Pt, 

yt+l ^yt + ravg,il/At ^Pt + ravg,il/At, 
yt+2 = yt+l + ravg,il/At =Pt + 2ravg,il/At, 

yt+At-1 = yt+At-2 + ravg,il/At = Pt + {At - l)ravg,i/t, 
yt+At = yt+At-1 + ravg,il/^t ^ Pt + Atravg,i/ At ^ Pt + ravg,il, 
yt+At+1 = yt+At + ravg,i2/^t ^ Pt + ravg,il + ravg,i2/ ^t, 

yt+nAt-1 — yt+nAt-2 i" ravg,in/ — Pt 'T 2^k=l ^avg,ik i ^avg,ik-i 
yt+nAt = yt+nAt-1 + ^avg,in/ At = Pt + ^fc=i Tavg,ik- 



(20) 



10. TIME INCREMENTS HIERARCHY AND SPLICING PROCEDURE 

Time series increments will be computed with different steps. For example, analogous 
to the discrete Fourier transform, time increments are equal the powers of 2 are considered. 
First, we calculate increments as a remainder of two nearest neighbouring time series values, 
then next nearest values are considered with the step of 2, 4, 8, 16 etc. Let us mark this 
difference in time as At. 

For every At we conduct an increment time series transformation leading to a time series 
of states. Further wc predict the future sequence of states and restore the time series with 
the given sampling rate according to the prognostic series of states. 

Time series, received as a result of restoring for different Ai, undergo the splicing proce- 
dure, which gives out an actual prognostic time series. 

Thus an increment hierarchy is chosen, where each one is responsible for its own sampling 
rate, which serves as a basis for predicting, restoring and splicing. 
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The splicing process implies the following. The procedure is iterative. With every next 
(along with the increasing step) sampling time the series corrects itself, driving the predic- 
tion, formed under lower At, to its actual point. Transformations that are conducted during 
splicing can be written down in the form of the following calculations. 

Suppose the splicing procedure has been finished for all time increments At < Deltati, 



the prediction has been done under the Atj sampling according to formulae 20, and as a 
result a time series i/i has been derived. Let us consider the iterative splicing procedure of 
the received series i/i with the series, acquired during all preceding splicing procedures gi. 

Since the series yi contains system points only in moments aliquot to Ati, and other 
points of the series are interpolated, the process of splicing implies the substitution of these 
interpolated points with the values of system points from previous At < Atj, which are 
contained in the series of results of previous splicing procedures Qi. Splicing algorithm can 
be written in the sequence of computations: 



zt = 9t= Pt, 

zt+i = gt+i + {yt+Au - 9t+Ati) /^U, 
zt+2 = gt+2 + 2 {yt+Au - 9t+Ati) /Ati, 

Zt+At-l = 9t+At~l + {Ati - 1) {yt+Au - 9t+Ati) / Ati, 
Zt+At = 9t+At + {Ati) {yt+ AU - 9t+Ati) /Ati = yt+Au, 
Zt+At+1 = 9t+At+l + {{yt+2Ati — 9t+2Ati) ~ {yt+Ati ~ 9t+Ati)) / ^ti, 
Zt+At+2 = 9t+At+2 + 2 {{yt+2Ati — 9t+2Ati) — {yt+Ati — 9t+Ati)) /Ati, 

Zt+nAt-1 = 9t+nAt-l + {{Vt-nAt — 9t~nAt) — {yt-{n-l)At — 9t-(n-l)At)) 

Zt+nAt = 9t+nAt + {{yt-nAt — 9t+nAt) — {yt+(n-l)At — 9t+{n-l)At)) = 
= 9t+(n-l)At ~ yt+{n~l)At ~ yt~nAt- 



(21) 
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11. RESULTS OF STOCK INDICES PREDICTION 

In this section we offer the results of stock indices prediction. The stock's indices 
databases are available from [SB]. Point 2000 indicates the starting moment of the prog- 
nosis: March 24, 2011. The green line on the next figures indicates real indice's or price's 
values. Our software for time series forecasting by the proposed methods is available from 



our website: http://kafek.at.ua/MarkovChainsl_2_20100505.rar 
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Figure 1: Stock indices prediction, a) Dow Jones Industrial Average - DJI (USA), b) 

FTSE 100 (Great Britain) 
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Figure 2: Financial companie's share prices forecasting, a) Morgan Stanley (USA), b) 

BNP Paribas (France) 
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Prediction time series with different input learning set's length are shown at the figjS] 
and |4} Prediction series for D Jl at the figure |3] are more correlated, than FTSE index at 
the figure |4j At the subplot b) of the above mentioned plots the mean value and standard 
deviations of the prediction's series are presented. The time of prediction series beginning 
on the next figures is the point 1000 and correspond to October 14, 2011. 
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Figure 3: Dow Jones Industrial Average - DJI (USA), a) Prediction series, calculated with 
different learning set's length, b) Mean value and standard deviation for prediction series. 
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Figure 4: FTSE 100 index prediction, a) Prediction series, calculated with different 
learning set's length, b) Mean value and standard deviation for prediction series. 
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The normalization procedure is proposed in order to compare indices and it's prediction 
series with different absolute values. The normalized values calculated with the following 
formula: 

y{t) — miniyit)) 

yn{t) = (,.y 22 

max [y[t)) — mtn {y{t)) 



Normalized prediction time series are shown at the figjS] (America), fig|6] (Europe, devel- 
oped countries), figjT] (Europe, PIIGS), fig|8] (Asian markets). All the figures contain mean 
time series, which are weighted average of countrie's stock indices predictions, weihted with 
GDP values [23 for the corresponding countries. 
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Figure 5: Normalized mean values for the prediction series of America's stock indices. 
Brazil (BVSP), Mexico (MXX), Canada (GSPTSE), Argentina (MERV), USA (S&P 500) 
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Figure 6: Normalized mean values for the prediction series of European 
stock indices. Developed countries: FTSE (Great Britain), DAX 
(Germany) FCHI (France), Netherlands (AEX) 
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Figure 7: Normalized mean values for the prediction series of European 
stock indices. Portugal (PSI20), Italy (FTSEMIB), Ireland (ISEQ), 
Greece (GD) and Spain (IBEX). 
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Figure 8: Normalized mean values for the prediction series of Asian 
stock indices. China (SSEC, HSI), Korea (KSll), Japan (NIKKEI), 
India (BSESN), New Zealand (NZ50). 
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Figure 9: Mean values of normalized World's powerful economies 
indices prediction series. 
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12. CONCLUSIONS AND FURTHER WORK 

Current paper suggests an algorithm of time series prediction based on complex Markov 
chains. Hierarchy of time increments principle allows to use the information, which is con- 
tained in the time series during the prognosis construction, to its fullest. Experimental work 
on stock market indices time series prediction shows the efficiency of the algorithm and 
confirms the relevance of further research of the offered method. 
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